Measures of Clade Confidence Do Not Correlate with Accuracy of Phylogenetic Trees
نویسندگان
چکیده
Metrics of phylogenetic tree reliability, such as parametric bootstrap percentages or Bayesian posterior probabilities, represent internal measures of the topological reproducibility of a phylogenetic tree, while the recently introduced aLRT (approximate likelihood ratio test) assesses the likelihood that a branch exists on a maximum-likelihood tree. Although those values are often equated with phylogenetic tree accuracy, they do not necessarily estimate how well a reconstructed phylogeny represents cladistic relationships that actually exist in nature. The authors have therefore attempted to quantify how well bootstrap percentages, posterior probabilities, and aLRT measures reflect the probability that a deduced phylogenetic clade is present in a known phylogeny. The authors simulated the evolution of bacterial genes of varying lengths under biologically realistic conditions, and reconstructed those known phylogenies using both maximum likelihood and Bayesian methods. Then, they measured how frequently clades in the reconstructed trees exhibiting particular bootstrap percentages, aLRT values, or posterior probabilities were found in the true trees. The authors have observed that none of these values correlate with the probability that a given clade is present in the known phylogeny. The major conclusion is that none of the measures provide any information about the likelihood that an individual clade actually exists. It is also found that the mean of all clade support values on a tree closely reflects the average proportion of all clades that have been assigned correctly, and is thus a good representation of the overall accuracy of a phylogenetic tree.
منابع مشابه
Retraction: Measures of Clade Confidence Do Not Correlate with Accuracy of Phylogenetic Trees
As a result of a bug in the Perl script used to compare estimated trees with true trees, the clade confidence measures were sometimes associated with the incorrect clades. The error was detected by the sharp eye of Professor Sarah P. Otto of the University of British Columbia. She noticed a discrepancy between the example tree in Figure 1B and the results reported for the gene nuoK in Table 1, ...
متن کاملPhylogenetic relationships of the commercial marine shrimp family Penaeidae from Persian Gulf
Phylogenetic relationships among all described species (total of 5 taxa) of the shrimp genus Penaeus, were examined with nucleotide sequence data from portions of mitochondrial gene and cytochrome oxidase subunit I (COI). There are twelve commercial shrimp in the Iranian coastal waters. The reconstruction of the evolution phylogeny of these species is crucial in revealing stock identity that ca...
متن کاملPhylogenetic relationships of the commercial marine shrimp family Penaeidae from Persian Gulf
Phylogenetic relationships among all described species (total of 5 taxa) of the shrimp genus Penaeus, were examined with nucleotide sequence data from portions of mitochondrial gene and cytochrome oxidase subunit I (COI). There are twelve commercial shrimp in the Iranian coastal waters. The reconstruction of the evolution phylogeny of these species is crucial in revealing stock identity that ca...
متن کاملOn the reliability of Bayesian posterior clade probabilities in phylogenetic analysis
This article discusses possible reasons why posterior clade probabilities obtained from Bayesian phylogenetic analyses might be inaccurate. It attempts to list all possible sources of uncertainty and error in Bayesian phylogenetic analysis. The choice of priors on trees has been suggested by several authors as a cause of inaccurate posterior clade probabilities. I argue strongly for using prior...
متن کاملMolecular Phylogeny of the Genus Lathyrus (Fabaceae-Fabeae) Based on cpDNA matK Sequence in Iran
Background: More than 60 species of the genus Lathyrus are distributed in Southwest Asia. It is the second largest genus of the tribe Fabeae, after Vicia, in the region (and in Iran with 23 species). In the regional Flora (Flora of Turkey, FloraIranicaand flora...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- PLoS Computational Biology
دوره 3 شماره
صفحات -
تاریخ انتشار 2007